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Abstract 

We study stochastic copying schemes in which discrimination between a right and a 
wrong match is achieved via different kinetic barriers or different binding energy of the 
two matches. We demonstrate that, in single-step reactions, the two discrimination 
mechanisms are strictly alternative and can not be mixed to further reduce the error 
fraction. Close to the lowest error limit, kinetic discrimination results in a diverging 
copying velocity and dissipation per copied bit. On the opposite, energetic discrimina- 
tion reaches its lowest error limit in an adiabatic regime where dissipation and velocity 
vanish. By analyzing experimentally measured kinetic rates of two DNA polymerases, 
T7 and P0I7, we argue that one of them operates in the kinetic and the other in 
the energetic regime. Finally, we show how the two mechanisms can be combined in 
copying schemes implementing error correction through a proofreading pathway. 

Living organisms need to process signals in a fast and reliable way. Copying 
information is a task of particular relevance, as it is required for the replication 
of the genetic code, the transcription of DNA into mRNA, and its translation 
into a protein. Reliability is fundamental, since errors can result in the costly (or 
harmful) production of a non-functional protein. Indeed, cells have developed 
mechanisms to reduce the copying error rate 77 to values as low as rj ~ 1CP 4 
for protein transcription-translation pQ and 77 ~ 10~ 10 for DNA replication [2J. 
Such mechanisms include multiple discrimination steps jTJ |2] and pathways to 
undo wrong copies as in proofreading [2J HJ |5J |B] or backtracking [7] . 

Biological information is copied by thermodynamic machines that operate at 
a finite temperature. There is agreement that this fact alone implies a lower limit 
on the error rate. However, contrasting results have been obtained regarding 
the nature of this limit. In particular, it is not clear when it is reached in a 
slow and quasi-adiabiatic regime, or in a fast and dissipative one. As clarified by 
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Bennett [8], information can be copied adiabatically. Indeed, the copying scheme 
proposed in Hopfield's seminal proofreading paper [2] reaches its minimum error 
at zero velocity and zero dissipation [9]. In contrast, a copolymerization model 
proposed few years later by Bennett [TJ 111! I12| , achieves its minimum error in a 
highly dissipative regime, where velocity and dissipation diverge. Some of the 
biological literature has favoured that the minimum error is achieved in near- 
equilibrium conditions [5]. This view is however not unanimous |13) . Recent 
biophysical literature supports a dissipative minimum error limit |14( ITU fTSj fT5] . 
Similar disagreements are also present in models including proofreading. The 
proofreading model in [TJ dissipates systematically less than the corresponding 
copying, while in other models 0H], at low errors, dissipation comes mainly 
from the proofreading step. 

In this Letter, we show how these contrasting results can be rationalized 
noting that copying can be performed either discriminating through binding en- 
ergies adiabatically, energetic discrimination, or discriminating through binding 
barriers dissipatively, kinetic discrimination. We begin by presenting a model for 
copying a single bit of information in the spirit of those proposed in |51 [T51[T71[T5] . 
see Fig. Ilk. A bio-machine such as a polymerase binds and unbinds monomers 
of different species to a template, trying to match it. We then move to the case 
of copolymerization, Fig. [TJ), where a polymerase assembles a polymer chain to 
match a template strand. Finally, we discuss two proofreading schemes, Figs. 
[TJ: and [Tjl, where the polymerase is assisted by an exonuclease that tends to 
remove wrong matches. 

a 
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Fig. 1: a) Copying of one bit. The lower-vertex triangle represents a bio- 
machine, such as a polymerase, binding or unbinding right and wrong 
matches with rates k± w . b) Copolymerization. A template strand (bot- 
tom) is copied into a new strand (top) . Right (r) and wrong (w) matches 
are added and removed with rates k r J. w . c) & d) Proofreading schemes. 
The polymerase is assisted by an exonuclease, represented by an upper- 
vertex triangle and characterized by k± w . In d), copies are made via an 
intermediate state characterised by k± w . 

Stochastic copying strategies of a single bit. The copying machine is de- 
scribed as a three-states system. Two are bound states in which the right (r) 
or wrong (w) molecule is attached to the machine. The third is a "blank" state 
(0), representing the state of the machine before a matching is done. To help 
physical intuition and following [TJ, we define the rates from the free energy 
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landscape in Fig. [2^. Right and wrong matching are characterized by a differ- 
ence in barrier height 5, and in the energy of the final states 7. The energy e is 
a chemical driving. 

The four rates k± w connecting the unbound state with the right and wrong 
states can be written (see Fig. |2^t,) as: 

k r + = uc e+s ; k r _ = ujq s ; k| = we 6 ; fc™ = we 7 . (1) 

where to is an overall rate scale. The master equation for the probabilities p r 
and p w of finding the system in the right or wrong state reads 

p r = (1 — p r — p w )k+ — k r -p r (2) 

Pw = (1 - Pr - Pw)k+ - k™p w 

where p$ has been eliminated by normalization. We study the time-dependent 
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Fig. 2: a) Energy diagram for copying rates. The barrier height difference 5, 
energy difference between right and wrong states 7, and chemical driving 
e are shown, b) Time-evolution of the error in the single bit copy for 
three parameter choices: 7 < 5 (green curve, 7 = 3 and S = 5.5), 7 > S 
(blue curve, 7 = 8 and S = 5.5), and 7 = 5 = 5.5 (black curve). The 
other parameters are e = 5 and lo = 4. 

error rate rj(t) = p r (t)/[p r (t) + p w (t)] for the system prepared in the unbound 
state, p r (t = 0) = p w {t = 0) = 0. At short times, t <§; the populations 

of right and wrong states are p r ~ tk r + and p w ~ tk™. To shorten notation, 
we define the function f(x) — e~ x / (1 + e~ x ) mapping energies into errors. The 
short-time error is then rj(t — > 0) = f(S). In the opposite limit of t a;" 1 , the 
system reaches equilibrium so that rj(t — > 00) = f(j) by detailed balance. At 
intermediate times, one can demonstrate from the analytical solution of Eqs. [2] 
that rj(t) is a monotonic function for any choice of rates (see [12]): increasing 
with time when S > 7 (i.e. f(5) < f(j)), and decreasing when 7 > 5 (i.e. 
/((5) > 7(7))- For 5 = 7, the error is time-independent. Examples of the three 
cases are shown in Fig. [2]d. 

To maximize accuracy, the copying reaction must be arrested when r](t) 
is at its minimum value, quenching the system into either a right or wrong 
copy outcome. In an enzymatic reaction, this corresponds to the irreversible 
transformation of a bound state into a product |2J . In [8 , where bits are encoded 
in ferromagnets, it corresponds to the decoupling from an external transverse 
field. It is then natural to define the kinetic discrimination regime S > 7, where 
optimal accuracy requires stopping the process as fast as possible. If 7 > 5, 
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energetic discrimination regime, optimal accuracy is reached when the reaction 
reaches equilibrium, i.e. at very long time. In all cases, accuracty can not be 
improved by combining the two mechanisms, as the lower limit on the error is 
determined by either 7 or S. Notice that in an energetic discrimination scheme, 
the quench can be performed slowly, at no dissipation [5]. In a kinetic scheme, 
the quench has to be fast and dissipative. 

Kinetic and energetic discrimination in copolymer ization. In copolymeriza- 
tion, a polymerase continuously adds and removes monomers to a tip of the 
growing copy strand, trying to match them with the monomers on a template 
strand (see Fig. [T]d). The model is defined by the incorporation and removal 
rates of right k r ± and wrong k± matching monomers. As before, the rates are 
given by Eq. [TJand Fig. [2^,. The chemical driving for right and wrong bases are 
thus e and e — 7 respectively. Previous studies assumed iso-energetic strands, 
i.e. 7 = [U [HJ [TTJ [12] . We relax this assumption and study how the copying 
velocity v, and the rate of entropy production or dissipated chemical work S $2U\ 
depend on the error rate rj for a general choice of 5 and 7. It is straightforward 
to show that v = k r + — (1 — ri)k r _ + fc™ — ryfc™ |TJ [IS], and also that S is given by 

S = vAS = v(l — r/)e + vi](e — 7) + vH(r/) (3) 

where AS* = S/v is the dissipation per step, and H(r)) = —rj\og(rj) — (1 — 
77) log(l — n) is the Shannon entropy of the error rate 77. The first two terms in 
Eq. ([3| represent the distinct chemical driving forces of right and wrong bases, 
multiplied by the flux of right and wrong incorporated bases. The last term of 
Eq. (J3J) corresponds to the information entropy increase due to incorporation 
of errors, hence information, into the chain [Tl fTTl fT2] . 
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Fig. 3: a) Dissipation per step in copolymerization. In all curves 7 = 5, while 
S varies. All curves tend to zero at 77 = f(j) s» 6.7 • 10 -3 . Blue curves 
are in the energetic region, 7 > 5, while green curves are in the kinetic 
region 8 > 7. b) j-S phase diagram, showing the kinetic and energetic 
discrimination regions compatible with an error r\ ~ f(7) ~ 9 • 10 -4 , 
and estimated values of (7, S) for P0I7 and T7. Tuning 77 shifts the limit 
/ _1 (t7) of phase regions along the line 7 = S. c) Behavior of the velocity 
v for the parameter choices in a). 

By imposing steady state flux conservation, we obtain the functions AS(r]) 
and v(ji) |19|, presented in Figpk and [3t for a fixed value of 7 and different 
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values of 6. The range of physically admissible errors rj depends on 7 and 5 (see 
OH) as 

min {f (6), /( 7 )] <r,<max [/(£), /( 7 )] , (4) 

with f{x) as previously defined. We now study the dissipative limit, r\ — > f(5), 
and the adiabatic limit, 77 — > /( 7 ). 

When 77 — > /(^), the chemical driving diverges as e ~ — log |r; — /(<5)| |19j. 
Substituting into Eq. ^ shows that also AS diverges (see Fig. |3^) as 

AS~7 ? e~7 7 log|»j-/(5)| for r, -> /(*). (5) 

Since e 3> 1, the information entropy ^(77) in Eq. |3| is negligible, and dissipa- 
tion is dominated by the chemical terms. As an effect of the strong driving, the 
velocity diverges as \q — /(5)| _1 , see Fig. [3J:. 

When r\ — > /( 7 ), both v and AS tend to zero, see Fig. [3^ and|3|3: all the 
chemical energy is invested in copying the information, none being wasted. The 
chemical driving is then 

e = log(l - rj) = log[l - /( 7 )] < for r, = /( 7 ). (6) 

Note that e is small and negative, to compensate the small positive entropic 
driving caused by H{rf) in Eq. 

By inverting Eq. Q, the values of 7 and 5 compatible with a given error 
-q must satisfy either 7 < f~ l {rf) < 5 or 8 < f~ l (rj) < 7 , with f~ 1 (x) — 
log(l + the inverse of f(x). This defines the two disconnetted kinetic 
discrimination (6 > 7 ), and energetic discrimination ( 7 > i5) regions of the 
( 7 , S) plane in Fig. ^jp. 

In the kinetic region, both AS* and v diverge in the minimum error limit, 
so that accuracy comes at the cost of high dissipation. In the energetic region, 
accurate copying comes at the cost of the copying velocity, which goes to zero in 
the adiabatic minimum error limit. This fundamental difference is at the core 
of the discrepancies between enzymatic copying models [5] that assumed lack 
of forward discrimination, 6 = in our language (see |19] for mapping), and 
copolymerization studies [TJ [HJ E2] that assumed iso-energetic strands, 7 = 0. 
Our results show that it is impossible to interpolate between the two, as they 
belong to two separate phase space regions. 

Operating regimes of T7 and Pohf polymerases. We now analyze two specific 
biological copying systems: DNA replication of the phage T7 [2J[5T], and repli- 
cation of human DNA by Pol 7 |22_. A recent experimental study |22_ points 
at the strong and asymmetric backward rates as the leading discriminatory 
mechanism in T7. We derived from [22] the copolymerization rates by assum- 
ing equilibrium nucleotide binding with dissociation constants K r — and 
K w = 200/iM for right and wrong base matching. Considering nucleotide con- 
centrations in a range of [dNTP] ~ 0.5 — 50/xM we obtain the binding states 
1/(1 + K r/w /[dNTP}). Multiplying them by the forward rates (360Hz and 

0.2Hz for right and wrong bases respectively) we obtain k r J w . The backward 
rates are k r _ « 2Hz and k™ ~ 0.04Hz [22]. These values give an error range 
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r\ ~ 10~ 6 — 1CU 4 , in agreement to the estimates in [2]. Usual estimates of the 
error assume linear binding, approximation valid for low [dNTP] and yielding 
the lowest end of the error range. The velocities are v ~ 5 — 250bps (bases per 
second), in agreement with the saturation rate measured in |22| . By inverting 
Eqs.Q, we can infer 7 w 14 and S w 8. Since 7 > 5, we conclude that T7 
operates in the energetic regime (see Fig. [3|d). 

DNA duplication by P0I7 was analyzed in [TT] with a variant of the copoly- 
merization model, where different monomer species are characterised by different 
rates. Agreement with experimental data in [21 J was obtained assuming that 
the copy be iso-energetic (7 = 0). We simplify the analysis in [TT] by averaging 
over the different monomer species. Using the same driving e w 5 determined 
for T7, we obtain 6 s» 11 and a range of error rates rj ~ 10~ 5 — 10~ 3 . In the 
limit of low [dNTP] it agrees with the estimates in |21lfTT] . As 7 = 0, P0I7 lies 
in the kinetic discrimination region (Fig. [3Jd). 

The estimates above indicate that the two polymerases achieve a similar 
error rate operating in the two different regimes. They thus experience different 
trade-offs. In T7 lowering [dNTP] (effectively, the chemical driving) reduces 
the error 77 and the dissipation AS, at the cost of a smaller speed v. This 
situation is similar to that of the blue curves in Figs. [3jt, and[SJ:. In P0I7 smaller 
error requires a stronger driving, hence dissipation |11| . This gives a higher 
polymerization rate, as in the green curves of Figs. [3^ and|3j;. 

Combining copying strategies in proofreading schemes. In copolymerization, 
kinetic and energetic discrimination cannot be combined to reduce the error, Eq. 
[4j We now explore the possibility of combining the two mechanisms in multi- 
step copying schemes involving a proofreading pathway. In proofreading, an 
initially copied base can be removed via an alternative pathway, see Fig. 1 and 
[TJi. Such erasing pathway is characterized by a discrimination which, a priori, 
can be energetic j p or kinetic 8 P , a distinct time-scale l/w p , and a (backward) 
driving e p . In an effective proofreading scheme, the minimal copying error of 
Eq. [4] is reduced by an additional proofreading factor, in principle energetic 
/(7 P ) or kinetic f(S p ). We discuss two proofreading schemes. In both of them, 
the proofreading rates k r ± w have the same structure as the copying ones, apart 
from a backward driving [19]. In the first, Fig. [lp, the copying step is identical 
to that in the copolymerization model, as in Bennett's proofreading model [JJ. 
In the second, Fig. [TJi, the copying step leads to an intermediate state, taken 
to its final form via rates k r ± w without further discrimination, as in Hopfield's 
model [2] . By imposing flux balance at the steady state we solved both models 
analytically [JJ5]. We fixed the discrimination factors, and for each error 77 
minimized AS over the remaining free parameters [19J, obtaining the curves of 
minimum dissipation vs. error in Fig. [4] 

As shown in [TT|] , there are no regimes in any of the two proofreading schemes 
where the error is lowered by the energetic factor f("f p ), while error reduction by 
a kinetic proofreading factor f(8 p ) is feasible, see Fig. [2J; and[2jl. Proofreading 
is thus only effective when it operates in the kinetic regime. This result is con- 
sistent with Landauer's principle [23 as, unlike copying, proofreading involves 
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Fig. 4: a) Minimum dissipation in proofreading without an intermediate step, 
Fig. [TJ;. For both curves, 7 p = and 5 p = 5. In the case of ener- 
getic discrimination in copying and kinetic discrimination in proofread- 
ing (energetic-kinetic), the other parameters are 7 = 5, 6 = 0. In the 
kinetic-kinetic case, we used 7 = and <5 = 5. b) Minimum dissipation 
in proofreading with an intermediate step, Fig. [IJl. Parameters are as 
in a). In both panels, the expected minimum errors /(10) and /(5), 
depending on whether proofreading is effective or not, are marked. 



erasure of information (errors) and thus is intrinsically dissipative. Further, 
by looking at the minimum errors in Fig. [4] one can conclude that, while 
kinetic proofreading is always effective when combined with kinetic copying 
(green curves), it is only compatible with adiabatic copying when an interme- 
diate state is present (blue curves) . This is a key difference between the proof- 
reading schemes in [1 and [2 : without an intermediate state it is impossible to 
find a regime where copies are produced adiabatically, and undone very quickly. 
The combination of kinetic proofreading with adiabatic copying step has the 
advantage of a lower dissipation (see Fig. [Ij), green vs. blue lines). 

In this Letter, we have shown how each copying step in stochastic copying 
can be unambiguously classified into one of two radically different classes, ki- 
netic and energetic discrimination. The existence of an energetic regime in the 
copolymerization model complements the view in the literature [TJ 111! I12| that 
low copy errors are achieved only in a highly dissipative regime. It also demon- 
strates how entropy-driven growth, a phenomenon studied in the large error 
limit [TJ [HJ [TTJ [15], can be exploited to reliably copy information. Copoly- 
merization is thus compatible with the principle of reversible computing stating 
that a copy can be performed adiabatically [8 . The analysis of two DNA poly- 
merases, T7 and P0I7, shows that the first operates in the energetic regime, 
while the second in the kinetic one. Both mechanisms are thus used by bio- 
logical systems. Finally, our study of proofreading proves that the two regimes 
discussed here are combined in more complex copying schemes. 

Our conceptual framework can be applied to a wider range of problems 
related to stochastic discrimination. Examples are detection of antigens by T- 
cell receptors [24 , and discrimination of a binary input in neural dynamics |25| . 
At the sub-cellular level, thermal fluctuations dominate and impose constraints 
on biological tasks. While the thermodynamics of bio-mechanical systems such 
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as molecular motors is well understood [SB] , the role of fluctuations in biological 
information processing such as bacterial chemotaxis presents still many open 
questions [27]. Our work shows that the emerging trade-offs may be complex, 
and depend on the region in parameter space where the system operates. 

This work was partially supported by a Max Planck Society scholarship (to 
P.S.) and a Ramon y Cajal Grant (to S.P.). We are grateful to A. Bernacchia, 
J. Garcia-Ojalvo, N. Mitarai, L. Granger, M. A. Muhoz and Y. Tu for a critical 
reading of the manuscript. 
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1 Supplementary Information 

This document contains additional details and derivation of the results presented 
in the Letter "Energetic vs. kinetic discrimination in biological copying". The 
document is organized as follows. Section 1 presents a full solution of the single 
bit copying model (model A in Fig. 1 of the main text), and a demonstration 
that the error is always a monotonic function of time. Section 2 details results 
on the co-polymerization model (model B in the main text). Section 3 illustrates 
the mapping between Hopfield's and Bennett's copying schemes. Finally, section 
4 and 5 present details on the proofreading models (models C and D in the main 
text, respectively). 

1.1 Stochastic copying of a single bit 

We wish to demonstrate that, for any choice of the rates, the solution of the 
system of differential equations: 



f = fc+(l — r — w) — k r _r 

w = k™(l-r-w)-k™w. (7) 

with initial condition r(t — 0) = w(t = 0) = leads to a time dependent 
error 

/ x w(t) 
•nit) = -. r y , -. . (8 
/V ; r(t)+w{t) y ' 

being a monotone function of time for all t > 0. In particular, rj{t) either 
strictly increasing, strictly decreasing or constant depending on the choice of 
the parameters. 

The solution of the system of equations ^ can be obtained with standard 
methods. First of all, the steady state solution is: 



Weg = ~ ^ (9) 

Upon defining Sr = r — r eq and 5w = w — w eq the time-dependent distances 
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from the steady state, a lengthy but straightforward calculation leads to 
N_{-r eq + ^[ q +y/q^+4kJkf\} 

M*) = -2 



e A+t 



(10) 



+ 



8w(t) 



A_t 



—N_ (q + y/q 2 + 4k r + k™) {-r eq + ^f[q+ yj q 2 + 4k r + k™\ } e 

7V+ (g - v /^+4fcpf) {-r eg + v^+4fcpf] } e 

2^(4+^ 

where we defined the eigenvalues 

-S± VS 2 -4c 
^± = o 



A+t 



(11) 



with S = fc; + fcl + k% + k™ and c = (k r + + k r _){kl + k™) - k r + k^, and the 
quantities 



q = (k r + + k r _ - k™ - k™) 



(12) 



We now study the sign of the derivative of the error rj(t). Clearly, its sign is 
the same as the sign of the derivative of the function 



/(*) 



w(t) w eq + Sw(t) 



(13) 



r(t) r eq + 8r(t) ' 
The derivative of f(t), before any simplification, reads: 

/ = D 2k^(k w — k r ){k w k^ + k r (k w + k^))e ( 3S +\/Q)*/ 2 
{ - 2QV S+ ^*/ 2 + c^[k r _ 2 + k w _ 2 + ¥_{-2k w _ + 2k r + - 2kP 
+ {k r + + kl){k r + + kl + ^Q) + k™{-2k r + + 2k w + + y/Q)} 
+ e^ +VQ)t [k r _ 2 + k w _ 2 - k™(-2k™ + 2k r + + y/Q) 

- {k r + + kX)(-k r + -k™ + y/Q) + k r _(-2k r + + 2k w _ + 2k1 + y/Q)}} (14) 



Q) 



where S and q are defined above, and we have also defined the function of the 
rates Q = q 2 + 4k+k+. The denominator D has a complicated expression that 
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we omit since it is squared, hence it does not contribute to the sign. In the 
nominator, the term in square brackets clearly has the sign of k™ — k r _L. We now 
move to the study the sign of the term in curly brackets, which can be expressed 
in terms of hyperbolic functions as 

{} = e H^+VQ)t (_ 2 Q + 2e s 5 Q CO sh( y/Qt/2) - y/QY, sinh( y/Qt/2) ) (15) 

The prefactor is positive, and we are left with the two terms inside the paren- 
thesis. The first is clearly negative. To determine the sign of the second, par- 
ticularly the term in brackets, one should note that sinh(x) > cosh(x) for all 
positive x, and that £ > y/Q, which is shown by expanding the squares. As a 
consequence, the second term in the brackets is always larger than the first, so 
that the whole term inside the parenthesis is negative. It follows that the term 
in the curly brackets is negative, and that the sign of /' is just the sign of From 
this we conclude that the error grows monotonically for all parameter choices 
such that k r _-k™. 

Finally, the rates are parametrized in the main text by their kinetic and 
energetic discriminations (5 and 7), the driving e, and an overall time-scale uj. 
The kinetic discrimination 8 appears in the forward rates, so that fc?J_/fc™ = c s . 
The driving e is defined for right bases, so that k\jk r _ — e £ . Finally, the 
energetic discrimination 7 reduces the driving of wrong bases, k+/k™ — e e_7 . 
Summarizing, we have: 

k r + = oje e+s ; k r _ = ue s ; k™ = we 6 ; fc™ = we 7 . (16) 

The condition k r _ — k™ is then equivalent to 5 > 7, which we termed the 
case of kinetic discrimination. Conversely, when k r _ < k™ (equivalent to<5 < 7) 
the error decreases monotonically (regime of energetic discrimination). 

1.2 Co-polymerization model 

Given the four rates fc™, fc™, H, and fcH, one can easily write the two rate 
equations as: 

d t [kr] = [&]fc; - [kr]k~ 

d t [kw] = [&}k% - [kw]k- (17) 

Following Bennett's original approach [T], we consider the steady state in which 
there are constant fluxes of wrong dt[kw] — J w — 771; [&] and right dt[kr] = 
J r = (1 — additions of aminoacids into the copied strain. Under these 

assumptions, on can show [U |3] that the error is given by: 

k+ - V 11 (18) 



k r + - k r _(l -n) 1 — 17 * 
while the average copying velocity is 

v = kl-k™i 1 + k r + -k r _{l-n). (19) 
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Using the same parameterization of the previous model, Eq. (16 1, we write the 



chemical driving e as a function of the energetic discrimination energy 7, the 
kinetic discrimination energy S, and the steady state error 77. The expression 
reads 

, 1 - e<- & 

T)(l - t])- 



€ = log 



77 — (1 — rf)e 



-6 



(20) 



By means of ( 20 1 , the average velocity can be expressed as 



1- (1 + ^)77 
77 — (1 — rj)e~ s 



(21) 



We now want to impose that 1) the argument of the logarithm in ( |20| ) has to be 
positive, and 2) the average velocity (21) should also be positive. Assuming of 
course < 77 < 1, the first condition is equivalent to (S — 7) [77 — (1 + e" 5 ) -1 ] > 0, 
while the second is equivalent to [(1 — e 7 )" 1 — 77] [77 — (1 + e 5 )^ 1 ] > 0. Combining 
these two conditions leads to Eq. (4) in the main text. 
Finally, the dissipation per step is 



AS 



77 log 
77 log 



+ (l-ry) log 
+ (1 - 77) log 



1 



1-7, 
1 



1 — 77 



77 log 



+ (1 - 77) log 



+ (1-77)6 + 77(5-7), 



k r _ 



(22) 



which can be expressed as a function of 77, S and 7 only by using Eq. (20 1 



1.3 Mapping of Hopfield's original model 

In Hopfield's formulation [2j, given the template c, by interacting through C and 
D, either the aminoacid Pc or Pjj can be added to an RNA chain. Addition of 
Pc will be the right addition, and addition of Pjy will be considered an error. 
The rate equation and steady state solution are: 

c + C ^ [cC] P c and v[Cc] = k' c [C] - k c [Cc] (23) 

kc 

and analogously for D. It is assumed that [C] ~ [D], and defined fc = [Cc]/[C] 
and fu — [Dc]/[D] as the fraction of incorporated C and D monomers given a 
template c. At steady state fc = 1 — /d, and fo is the error 77 = fjj. Solving 
the system above we arrive at 

V _ !_d_ _ k' D v + k G 
1 - n fc k' c v + k D 

Identifying these rates with those in our model according to Fig. 1, the mapping 
to Hopfield's model is finished: kc = JfcH, k' c = H , kry = k™ and k' D = /«"'. 

To verify the mapping we study two limiting cases. For 7 = (as Bennett 
assumed in [I]) we have that if v — > 00, then 77/(1 — 77) — > k' D /k' c = e~ s ; and 



(24) 
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if v — > 0, then 77/(1 — 77) — > k' D kc /k' c k£> = 1. On the other hand for 5 = (as 
Hopfield assumed in [2]) we have that if t> — > 00, then 77/(1 — 77) — » k' D /k' c = 1; 
and if i> — > 0, the classical result is obtained 77/(1 — 77) — > k' D kc /k' c ki> = 
kc/kjj = e~ 7 , in exact agreement with the results obtained above. 



1.4 Proofreading model without intermediate state 

A minimal model of Kinetic Proofreading (KP) requires at least two different 
pathways. The first is the copying pathway introduced above, characterized 
by a driving which tends to make the chain grow. On the other hand, the 
driving of the second pathway is backward, thus undoing copies on average. The 
copying pathway has a bias towards adding right bases by a faster (kinetic) 
and more stable (energetic) binding. Conversely, the proofreading pathway has 
a bias towards removing wrong bases by a faster and less stable unbinding. 
The combination of both can reduce the minimal error given by the standard 
copy, by the discrimination factor of the proofreading pathway. The simplest 
proofreading scheme consists of the copying scheme introduced before, and a 
parallel reaction which we characterize by four additional proofreading rates 
k± w 



1.4.1 Rates parametrization 



We choose the same copying rates of the standard copying scheme, see Eq.( 16 1. 
Further, we introduce proofreading rates which are analogously characterized 
by a kinetic and energetic proofreading discrimination factors (5 P and 7 P ), a 
backward driving e p , and an additional time-scale lo p . In the case of proofread- 
ing, we define the driving in the backward right additions, that is k T _lk\ = e £p . 
The kinetic discrimination is also backwards, and so k™/kL — e Sp . Finally, 
the energetic discrimination is reflected in a higher backward driving of wrong 
bases, such that k™/kY = e ep+7p . One can then write the proofreading rates as 

k r _ = uj p e e "~ Sp ; k r + =Lj p e- s " ; jfe™ = w p e £p ; k^ ^ uj p e~^ . (25) 

The energy levels corresponding to this parametrization of the rates are illus- 
trated in Fig. ([5]). Notice that the end-states in the proofreading pathway have 
a difference in energy 7 — 7 P . While in some coarse grained models such a be- 
haviour may be justifiable through external agents, typically one would expect 
this difference not to exist, so that in the main text we always fixed 7 P = 7. 
Further, we anticipate that numerical results show that the proofreading step 
is always kinetic. This means that the value of j p , as soon as it is positive, will 
not anyway affect the minimum error achievable by the system. 
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KP with no intermediate state 




Fig. 5: Energy diagram of the reactions corresponding to the proofreading 
scheme with no intermediate steps. 



1.4.2 Solving the model 

The kinetic equations in this case are: 

d t [kr] = [&](fc; +k r +)- [&r](k r _ + k r _) 
d t [kw] = +kl) - [&w]{k™ + k™). (26) 

Also in this case, the steady state solution can be obtained by considering the 
fluxes of right and wrong bases added: d t [Szw] = J w = rjv[h] and right <9 t [&r] = 
J r = (1 — ?7)w[&]. The error as a function of the rates is analogous to the one 
for simple copying: 

k™ + k%- rKg + *g) = m 
k r + + k r + -(l-r))(k r _+kL) 1-v' 

The next step is to derive from this expression the driving e as a function of 
the error, the discriminations, and the two new additional parameters: the 
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proofreading driving e p and its characteristic time scale oj p . The result is: 
1 



e = log 



1 - rj(l + c 5 ) 



{77(1 - »y)(e 



(28) 



The velocity is also analogous to that of the simple copying scheme: 

v = kl + fc™ - r](k™ + k™) + k r + + ~k r + -(l- rj){k r L + k r _). (29) 

However, for the entropy production rate, one has to consider the transitions 
correponding to the two pathways independently: 



S = {k™-T}k™)l0g 



+ (k r + -(l- V )k r _ )log 
+ (k r + - (l-Jj)fcl) log 



(1 - rf)k r _. 

K 

{l-rj)k r _ 



(30) 



)} 



Finally, the dissipation per step is simply calculated as AS 1 = Syv. 



1.4.3 Minimization procedure and numerical results 

For each given value of the error r\ and the four parameters 7, j p , <5, 5 pi we iden- 
tified the values of the two remaining free parameters lo p and e p corresponding to 
the minimum dissipation per step. In order to avoid local minima, we adopted 
a sistematic minimization scheme: the two parameters have been varied with 
a logarithmic step equal to 1.04, in an interval 10~ 5 < u! p ,e p < 10 9 . In this 
region, we found the minimum dissipation per step with the constraint of a 
positive reaction velocity. We also checked a posteriori that no minimum was 
found at the boundaries of the minimization region. 

A sistematic simulation study of the 4 possibilities of energetic/kinetic copy 
coupled to energetic/kinetic proofreading is presented in Fig. The results 
allows us for reaching the following conclusions: 

• The proofreading pathway can reduce the minimum error in the kinetic 
regime only. This can be seen in the lower panels of Fig. ([6]), where 
increasing 7 P does not affect the minimum achievable error. In particular, 
in the bottom left panel the copying is energetic and the minimum error 
is given by 7(7) = e~ 7 /(l + e r ) w 0.0067 for 7 = 5. In the bottom 
right panel, the copying is kinetic and again the minimum error is given 
by f{6) ~ 0.0067 for S = 5. The minima in the two figures correspond 
to parameters such as the proofreading reactions has an average forward 
flux instead of backward, so that the proofreading pathway works as an 
effective parallel adiabatic (energetic) copying pathway. 

• cooperative error reduction only takes place when both pathways are in 
the kinetic region. In the top right panel, increasing S p does not reduce the 
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Energetic - Kinetic 



Kinetic - Kinetic 




Energetic - Energetic 



Kinetic - Energetic 




Iff* 



10 1 



1<T 



10"' 



Iff 1 



Fig. 6: Study of the four possible combination of energetic/kinetic discrimina- 
tion and energetic/kinetic proofreading in Bennett's model. In the left 
panels the copy is energetic; in particular we chose 5 = and 7 = 5. 
Conversely, in the right panels the copy is kinetic with 5 = 5 and 7 = 0. 
In the top panels the proofreading scheme is purely kinetic (jy p = 0), 
while in the bottom panel we fixed 5 P = and varied j P . 



error. The only case in which the error can be reduced is in the kinetic- 
kinetic case of the top right panel, where the minimum error is given by 
f(5)f(5 p ) « f(5 + 5 P ) 0.0067, 0.0009, 0.0001 for 5 p = 0, 2, 4 respectively. 
We remark that this feature is a peculiarity of this model. We will show 
in the next section how including an intermediate state in the copying 
pathway allows for error reduction with an energetic copy and a kinetic 
proofreading. 

1.5 Proofreading model with intermediate state 

In this section we present more extensive results on model 4 of the main text. 
This model presents some analogies with the previous one, except that copying 
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occurs via an intermediate state, denoted with a "*", which is connected with 
the final state in which the aminoacid is incorporated. This final state has also a 
proofreading step. The overall reaction scheme is more in the spirit of Hopfield's 
original proofreading mechanism. 

1.5.1 Parametrization of the rates 

The forward copying rates from the unbound to the intermediate state are de- 
fined in exactly the same way as the copying rates in the previous models, see 



Eq. (16 1. As in Hopfield's original model, the transition rates from the interme- 
diate state to the final state have no discrimination, but have their own driving 
e* and time scale given by lo* . They obey the relations k+/k r + = 1, k^/k™ — e e 
and k r + /k r _ — e e . It is not hard to show that adding a discrimination below 
that of the original copying does not reduce the error beyond the critical error. 
Adding a bigger one simply reduces it to the critical error of this secondary copy, 
unlike the additive effect of proofreading. The rates can be simply written as: 

fc; = w*e £ * ; k r _=oj* ■ jfe™ = wV* ; = uo* (31) 

The final state is then connected with the initial state by the same proof- 



reading rates defined in the previous section, Eq. (25 1. The full energy diagram 
is depicted in Fig. (j7l. As before, the energy difference 7 — j p is irrelevant as 
the proofreading step has to be a kinetic step, and so we choose it arbitrarily 
to be null. Again, this corresponds to the physical requirement that the energy 
of the chain can not change if no base is added. 



KP with one intermediate state 




Fig. 7: Energy diagram of the reactions corresponding to the proofreading 
scheme with an intermediate step. 
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1.5.2 Solving the model 



With the notation introduced in the previous section, it is easy to write the four 
kinetic equations of this proofreading scheme: 



dt[R] = [R*]k+ + [k]k r + - [R](kL + k r _) 

d t [W] = [W*]k™ + [k]k™ - [W]{k™ + jfe™) 

d t [R*} = [k]k r + + [R]k r _-[R*](k r _+k r + ) 

d t [W*] = [&]Jfe™ + [W]k m - [W*]{k™ + jfe™). 



(32) 



The easiest way to obtain the solution is by flux balance at the steady state of 
constant growth velocity v, which corresponds to: 

[W]v = ([&]jfe™ - [W]k™) + {[W*]k™ - [W]k™) 
[&]fc™ - [W*]k™ = [W*]k% - [W]k™ 

[R]v = ([k]k r + - [R]k r _) + {[R*]k+ - [R]k r _) 
[&] k\ - [R* ]k r _ = [R*]k r + -[R]k r _. (33) 

As before, we seek equations to determine the error rate and the velocity as a 
function of the rates. We proceed by dividing each of the equations in section 
by k and define W/k = r/, R/k = (1 - rj), W*/k = w*, R*/k = r* . By means 
of the 2nd and 4th equations we find an expression for r* and w*: 



w = 



k™ + r]k™ 

k r + + (1 - r})k r _ 
¥ +k r + 



(34) 



Substituting into the other 2 equations lead to two coupled equations for rj and 
v. 



rjv 



(i-n)v = [k r + -(i- v )k r _] 



kl^ 



r}k % _ 



k™ + fc^j 



-Vk. 
(1 - r/)k r _ 



k r _ +k r + 



-(1-V)~k r - 



• (35) 



Now we multiply the first equation by (1 — rj), the second by r\ and subtract the 
second from the first to find a closed expression for r\: 



(1 - 77)(^ - + (1 - r?) 



~r 7.?/) 1 l.iii ' — 



- V [k r + -(l- v )k r _]-v 



r k r + + (1 - n)k r _ 

+ k r _+k r + 



k™+k™ 
(1-V)k r _ 



0. 



(36) 
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Again, this formula can be inverted to obtain the copying driving e as a 
function of r\ and the other energy differences: 



77w p e- d » - (1 - r?He-T* + t?(1 - tj) [u P e e *{l - e~ d ") + 



)( 



*0«*)(CT+CJ*G' ! *) / 



ure e 



e d +o;* e e 



(37) 

The velocity is straightforward to calculate from one of the expressions in 
( 35 ) , and is simply: 



-kl 



n 



k™ ki + T]k i : 



(38) 



Finally, we calculate the entropy production by summing the six contribu- 
tions of the local fluxes of the system. This results in the following lengthy 
expression: 



kS = (kk™-W*k™)log 
+ (k~k™-Wk™) log 
+ (R*k r + -Rk r _) log 



' kk^ ' 


_W*k™_ 


~ kkl~ 




Wk™ 




~R*k\~ 




Rk r _ _ 





+ {W*k™ - Wk™)lo£ 
(kk r + - R*k r _) log 
■ (kk r + - Rk r _) log 



Wk v 



' kk r + 
R*k r _ 

kk 



Rk r _ 



(39) 



Dividing by k and using the expressions for r* , w* and 77, we obtain the rate of 
entropy production: 



k™ + k t 
+ {kX-V~k-)lo 



(kx + vk-)k: 
{k% + r)k™)k% 



(A: 



rjk™ 



kl + (l- r))k r _ 



+ k r _+k 
k r + + (1 - r])k 



k r _ )log 



(k r _ + k r + )k\ 



k r _ + k r + 
+ (k r + -(l-r,)~k r _)lo. 



k^-(l- V )k r _)lO; 



(k r + + (1 - ri)k r _)k r _ 

(k r + + (1 - v )k r _)k r + 



(k r _ +k r + )(l -r))k r _ 



(l-ri)k r _ 



(40) 



1.5.3 Minimization procedure and numerical results 



In analogy with the previous model, for each value of the parameters S, 5 P , 7 and 
7 P and the variable 77 we found the values of the free parameters corresponding to 
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the minimum dissipation per step. In this case we had to minimize with respect 
to four free parameters: Li p , e p , uj* and e*. Given the number of parameters, we 
implemented a larger logarithmic minimization step, equal to 1.2. 



Energetic - Kinetic 



Kinetic - Kinetic 



10" 



10 u 
10 1 













5„ =2 




5 P=1 





icr ict 
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Energetic - Energetic 




Kinetic - Energetic 



10" 




Fig. 8: Study of the four possible combination in the proofreading model with an 
intermediate state. In the left panels the copy is energetic; in particular 
we chose 6 = and 7 = 5. Conversely, in the right panels the copy is 
kinetic with 6 = 5 and 7 = 0. In the top panels the proofreading scheme 
is purely kinetic (j p = 0) , while in the bottom panel we fixed 6 P = and 
varied j p . 



The result of Fig. ([8| are consistent to those of the previous model, see Fig. 
The only important difference is: 

• The presence of an additional step in the copying pathway allows for error 
reduction via an energetic copy - kinetic proofreading scheme. This can 
be seen in the top left panel of Fig. ([8|, where the minimum error does 
depend on 6 P via the usual function f("f)f(6 p ) ~ f("f + 6 P ). This is at 
variance with model 3, shown in Fig. ([6]), where the minumum error in 
the same case was simply equal to f(-f). 
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